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Design of Binary Network Codes for 
Multi-user Multi-way Relay Networks 

Ang Yang, Zesong Fei, Chengwen Xing, Ming Xiao, Jinhong Yuan, and 

Jingming Kuang 

Abstract 

We study multi-user multi-way relay networks where N user nodes exchange their information 
through a single relay node. We use network coding in the relay to increase the throughput. Due to 
the limitation of complexity, we only consider the binary multi-user network coding (BMNC) in the 
relay. We study BMNC matrix (in GF(2)) and propose several design criteria on the BMNC matrix to 
improve the symbol error probability (SEP) performance. Closed-form expressions of the SEP of the 
system are provided. Moreover, an upper bound of the SEP is also proposed to provide further insights 
on system performance. Then BMNC matrices are designed to minimize the error probabihties. 

Index Terms 

A^-way relay, binary network coding, symbol error probability. 

I. Introduction 

Network coding (NC) is considered as a potentially powerful tool for efficient information 
transmission in wireless networks, where data flows coming from multiple sources or to different 
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sinks are combined to increase throughput, reduce delay, or enhance robustness [IIl-[|3l. Consider 
a two-way wireless system where two source nodes communicate with each other through the aid 
of one relay node [ffl-dSl. With network coding, each of the two transceivers employs one time 
slot to transmit a packet to the relay in a conventional time-division multiple access (TDMA) 
scheme. Next, the relay takes the exclusive-or of these two packets and broadcasts the result 
during the third time slot. Armed with the packet it sent to the relay, each of the transceivers can 
then recover the data originating at the other relay, with the network having only used three slots 
rather than the traditional four. In what follows, we shall extend NC to the multi-user, multi-hop, 
multi-relay and multi-radio wireless ad hoc networks, which are introduced in [|9l- [fr3l . 

As previous related work, the sink bit error probability (BEP) for the coded network with 
memory less and independent channels is investigated in [|T4ll . The alphabet size of the code is 
GF(2™). In [[TSll . finite-field network coding (FFNC) is designed for multiple-user multiple-relay 
(MUMR) wireless networks with quasi-static fading channels. For high rate regions, FFNC has 
significantly better performance than superposition coding. In [fT6ll , using code division multiple 
access (CDMA) of an interference limited system, a jointly demodulate- and-XOR forward (JD- 
XOR-F) relaying scheme is proposed, where all users transmit to the relay simultaneously 
followed by the relay broadcasting an estimate of the XORed symbol for each user pair. The 
problem of joint resource allocation for OFDMA assisted two-way relay system is studied in [17] 
and the objective function is to maximize the sum-rate through joint subcarrier allocation, sub- 
carrier pairing, and power allocation, under the individual power constraints at each transmitting 
node. Several beamforming schemes are proposed in [TS^ for the scenario where multiple pairs 
of users exchange information within pair, with the help of a dedicated multi-antenna relay. A 
cooperation protocol based on complex-field wireless network coding is developed in a network 
with N sources and one destination [19|. To deal with decoding errors at sources, selective- and 
adaptive-forwarding protocols are also developed at no loss of diversity gain. For the multiple- 
access relay network, the capacity approaching behavior of the joint network LDPC code is 
analyzed in [|20]|, [[2l]|. 

Above literatures focus on the information exchange of multiple pairs of users with or without 
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the assistance of one relay node. For more general cases, in a practical network, there are 
multiple relays or multiple hops. In [|22l . new approaches to LDPC code design for a multi-source 
single-relay FDMA system are explored, under the assumption of uniform phase-fading Gaussian 
channels. In [23], a binary field NC design over a multiple- source multiple-relay wireless network 
over slow-fading channels is studied. In [24], a novel scheme of multi-channel/interface network 
coding is proposed, which is based on the combination of a new concept of coded-overhearing 
and coding-aware channel assignment. In [1251 . the power allocation policies are investigated 
across the relays for automatic gain control (AGC)-based amplify-and-forward (AF) distributed 
space-time code (DSTC) systems in the two-way relay networks. In [26], with a new flow-based 
characterization of pairwise intersession network coding, an optimal joint coding, scheduling, 
and rate-control scheme can be devised and implemented using only the binary XOR operation. 
In GTll . a novel concept of wireless network cocast (WNC) [28] is considered and its associated 
space-time network codes (STNCs) are proposed to achieve the foretold objectives. However, 
CDMA-like, FDMA-like and TDMA-like techniques are proposed in ||23, [|28]|, where each 
symbol is assigned a complex- valued signature waveform, the dedicated carrier and the symbol 
duration. In [29], several interesting properties of network coding matrices are discussed in a 
network where users have independent information to send to a common base station. 

In [301, it has been shown that for the A^-way single-channel relay network, it takes at least 
(2A^ — 1) time slots for the linear NC scheme without opportunistic listening to perform a round 
of the A^-way relay, where there are A^, with N > 2, end nodes exchanging their information 
with the assistant of one A^-way relay with single antenna. However, [30] focus on a general 
linear programming framework for solving the throughput optimization problems and a joint 
link scheduling, channel assignment, and routing algorithm for the wireless NC schemes to 
closely approximate the optimal solutions. The detailed linear NC for A^-way single-channel 
relay networks, such as how A^ information packets are encoded into A^ — 1 pronumerals, is not 
investigated in [|30ll . 

In this paper, we take a step further to investigate the efficient linear NC for A^-way single- 
channel relay network, which is also discussed in [|30l . As shown in ||23l , in the case that 
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the network size (i.e., the number of the sources and the number of the relays) and the frame 
length (i.e., the number of symbols or Galois filed elements in a frame) are large, we need to 
choose a large size of the Galois fields. Therefore, the encoding complexity of the GF(q) codes 
will significantly increase. Since binary network coding is of low complexity, binary multi-user 
network coding (BMNC) is considered here to increase throughput. Several design criteria that 
the BMNC matrix should follow to increase the system performance are provided. Moreover, 
the effects of the noise and the BMNC matrix are studied, based on which the symbol error 
probability (SER) of the system is provided. To improve the system performance further, BMNC 
matrices are designed for arbitrary number of users, which minimize the bound of SEP. 

The paper is organized as follows. In Section II, the system model is introduced. In Section III, 
BMNC decoding process and a design criterion on the BMNC matrix are presented. Performance 
analysis is shown in Section IV, which includes BMNC matrix analysis, closed-form expressions 
of the SEP and throughput of the system, the tight upper bound of SEP of the system and the 
designed BMNC encoding matrix. In Section V, the optimality of the matrix given in Section 
IV-D is discussed. Simulation results are presented in Section VI and the conclusions are given 
in Section VII. 

Some notations are listed as follows. Symbol (A)^^. presents the element of matrix 

A. Symbols ®, J] ® denote the addition and the summation in GF(2), respectively. Symbol o 
presents element-wise product and (A o B).^. = (A)j^ (^)ij- Symbol Y[° presents the element- 
wise product of multiple matrices or vectors. 

II. System Model 

Consider a wireless network with N user nodes C/j, i — 1,. . . ,N, and one relay node R, 
as shown in Fig. 1. Each node has only one antenna, which can be used for both transmission 
and reception. For practical services such as video conference in which each user may want to 
have a discussion with the other users, the N users need the information of other users and they 
exchange information with the assistance of the relay R. Without loss of generality, in one time 
slot, the exchanged information bit of Ui can be denoted by Xj, 1 < i < N. Whereas, in practice. 



the user nodes and the relay will transmit the information in packets that contains a large number 
of symbols. The user nodes will collect all the transmitted packets and then jointly detect them. 
We assume that the direct links between the users are not available. All communications must 
be through the relay. 

Take Ui for example, it needs the information from the other N — 1 users, while the other 
A'^ — 1 users need the information of Ui. In the traditional scheme, considering the time division 
transmission schemes, the traditional scheme needs 2N time slots to finish the information 
exchange, where N time slots are used for the relay to receive the N information bits of the N 
user nodes and the other N time slots are used for the user nodes to receive the N information 
bits from the relay. 

In order to improve the system performance, we propose a BMNC scheme, in which only 
2N — 1 time slots are used. In this scheme, the transmission can be divided into two consecutive 
phases. 1) In the source transmission phase, each user node sends its own information to the 
relay node, which takes N time slots. The relay receives and then detects the N information bits 
from the N users. 2) In the relay transmission phase, the relay linearly combines the detected 
information bits, and then broadcasts the combined information bits to all the users. Since each 
user knows its own information, only the information bits of other — 1 users are needed. Thus, 
at least N — 1 information bits should be broadcasted from the relay to all the users. Finally, 
the BMNC scheme takes 2N — 1 time slots to achieve the information exchange. 

In the source transmission phase, the received symbols at the relay node are 

yo - HoM(x) + No, (1) 



where yo 



yo,i yo,2 ■■■ yo,N 



T 



denotes the received signals at the relay, Hq = diag{ 



ho,i: ^0,2, ■ ■ ■ , ^o,Ar} denotes the fading coefficients, x = 



-iT 



Xi Xi ... xat 



denotes the bits 



of the users, M(.) denotes the modulation transformation and M(x) denotes the transmitted 

T 

denotes the additive white Gaussian 



symbols of the users, and No — 
noise (AWGN) with zero mean. 



Then the relay detects the received information and obtains an estimation of the source bits 

1 T 



Xi X2 



Xn 



. Then linearly network coding is proposed to combine information 



bits into — 1 bits, which can be shown as follows 



(Fx) mod (2) = r, 



(2) 



denotes 



where F is the network encoding matrix in GF(2), vector r = n r2 ■ ■ ■ r^-i 
the — 1 information bits that the relay will broadcast. Note that rj is the information bit to be 
transmitted in time slot i, i E \l,N — \]. The network encoding matrix F can be described as 



/l,2 

/2,1 /2,2 

/iV-1,1 fN-1,2 
f 1 f2 ■ ■ ■ f AT 



fl,N 
f2,N 

N-l,N 



(3) 



{N-l)xN 



J {N-l)xN 

where fj^i is one element of the network encoding matrix in GF(2) for j G [1, — 1] , i G [1, A^], 



fl,i f'. 



2,i 



N-IA 



which is related to Ui and the jth symbol that the relay sends. Vector fj - 
is the iih. column vector of F, which denotes the relationship between Ui and the A^ — 1 symbols 
that the relay sends. 

The symbol GF(2) is referred to the Galois filed of two elements [|3TI . In our work, it consists 
of and 1. Over GF(2), many well-known but important properties of traditional number systems, 
such as real number, rational number etc., are retained: addition has an identity element and an 
inverse for every element; multiplication has an identity element "1" and an inverse for every 
element but "0"; addition and multiplication are commutative and associative; multiplication is 
distributive over addition flST). 

In the relay transmission phase, the information that Ui receives is 



y, = H,M(r) + N„ 



(4) 



where 



n T 



yi,i yi,2 



denotes the signals that Ui receives, Hj = diag{ 



hi^i, hi^2, ■ ■ ■ , hi^N-i} denotes the fading coefficients, and Nj 



denotes the AWGN with zero mean. Then the relay detects the received information and 

1 T 



obtains r. 



Finally Ui decodes the information of other users 



through fj, F, Xj. 



III. BMNC Decoding Process 

As discussed above, the relay needs to broadcast at least — 1 coded bits. However, arbitrary 
encoding may cause some users can not decode the source information bits even though there 
is no noise in the system. Thus, the network coding process should be designed carefully. 

Clearly, Ui only knows its own information Xi, the A^ — 1 bits fj that it detects from the 
received information and the network coding matrix F. Then we investigate the relationship 
between Xi, fj and F. From we have 

N 

k=l 
N 

k=l 



(5) 



N 

-l,kXk 

L k=l -1 (Ar_i)xl 

Separating the information of Ui and other users, the above equation can be rewritten as 

N 

E ®fl,kXk © fl,iXi 
k=l,kj^i 
N 

k=l,k^i 



N 



(6) 



E '^fN-l,kXk © fN~l,iXi 
L k=l,k^i J (jv_i)xl 

{{FiSci) mod (2)} © {{fiXi) mod (2)} , 



where F, 



fi 



fj-l fjH 



is the network sub-encoding matrix of Ui, Xj 



-I T 

Xi ■ ■ ■ Xj+i ■ ■ ■ 



We denote x, 



as the bits obtained by network 



decoding at f/j. In the BMNC decoding, based on Xj can be obtained through 

h = {(F,x,) mod (2)} © {(fiXi) mod (2)} . (7) 
Adding {(fjXj) mod (2)} on both sides of the above equation, dV]) can be rewritten as 

(FiXi) mod (2) = ii® {(f,x,) mod (2)} . (8) 

If the matrix Fj is not full rank, the column vector {(fjXj) mod (2)} does not have — 1 
independent elements so that f/j can not obtain all the information bits of other — 1 users. 
Thus Fj should be full rank, then the inverse matrix F~^ exists. Multiplying F^"^ on the two 
sides of dl]), we have 

X, = {Fri {f, © {{i,Xi) mod (2)}}} mod (2) . (9) 

It can be seen that when Fj is full rank, f/, can decode the information through (|9]l. If Fj is 
not full rank, f/j can not obtain all the information bits of other users. Thus, Fj should be full 
rank for i E [1, N] to ensure that all the users can acquire the information of the other users. 

Then we propose the following design criterion on the BMNC matrix to achieve the informa- 
tion exchange. 

Theorem 1: For f/j, if Fj is full rank, then 

N 

is the necessary and sufficient condition that F^ is full rank for j E [1, A^]. 

Proof: See Appendix A. ■ 
Using Theorem 1, the network coding matrix F can be easily designed through one full rank 
matrix in GF(2). Moreover, in the following. Theorem 1 is used for performance analysis. 



IV. Performance Analysis 

Above, network encoding and decoding protocols are investigated. From it is evident 
that different F results in different system performance. Thus, Theorem 1 is not sufficient for 
further performance analysis and improvement. In this section, the BMNC matrix and the error 
performance of the system will be analyzed. 

A. Network coding matrix analysis 

In what follows, we shall study how the network coding matrix affects error rates at Ui. First, 
we have the following result: 

Theorem 2: Since Ui needs to obtain other — 1 users' information bits, the error vectors 
that user i receives are 

(11) 

= {^i ® Sci) ® {{xi ® Xi) Ijv-ixi) © {(F,-^ (r©fi)) mod (2)}, 



where x. 



T 



refers to the information that Uj wishes to 



Xl ■ ■ ■ Xi-i Xi^i ■ ■ ■ X]\! 

obtain, Iat^ixi is a column vector with A^ — 1 elements which are all 1. 

Proof: See Appendix B. ■ 

B. Exact system performance 

In this subsection, closed-form expressions of the SEP and throughput of the system will be 
derived. 

The SEP of the system: 

First, we shall investigate the addition in GF(2) and give the following lemma which will be 
used for our results later. 

Lemma 3: For addition of Q numbers in GF(2), we have 

x:%=x:(-2r^ E Ha,,., (12) 

9=1 <?=1 1<P1<P2<---<P9<Q j = l 
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where G {0, 1}. 

Proof: See Appendix C. ■ 
Since (fTTl) is not convenient for SEP analysis, (fTTI) should be transformed and the third item 
of (fTTI) can be rewritten as 

{Fri(r©?,)}mod(2) = |[Gi,i G,,2 ■■■ G,,,v-i ] (r © mod (2) 

iv-i (13) 
= J]®(G,„o(r©r,)), 

n=l 

where vector Gj ^ is the /cth column vector of F~^. 
Substituting ([H]) into ([11]), we have 



. n=l 



Xe,i = (X, © ii) © ((X, © Xi) ItV-Ixi) © ( J] ^ ° ® ^^^^ 

E 



Af+1 
n=l 



(14) 



where a^^i = (xj © Xj), ai_2 = {xi © x^) Iat-ixi, aj,„ = Gi,n-2 o (r © f^), for 3 < n < + 1. 
Using Lemma 3, (fT4l) can be expressed as 

TV+l n 

xe, = ^ (-2)^^-^ n^Pr (15) 

n=l l<Pi<V2< - <Pn<N+l j=l 

Based on (fTSl ). using Bayesian formula, the error probability of user i can be calculated as 

Pe,i (F) =E[|Xe,i|] 



Af-1 

(16) 



A:=l 

Af-17V+l n 

EE (-2)"" E 

k=l n=l l<pi<P2< - <Pn<N+l j=l 



11 



E [(a,i).] 



(17) 



where 

E [xk © Xk]-, k <i - 1, 

E[xk+i® Xk+i] , i<k<N-l, 

E [(ai,2)fc] =E[x^® Xi] , 
E [(a,,,),] =E[rk® hA {^i^)k,i-~2 , for 3 < / < iV + 1. 
For BPSK, the relationship between the SEP and the received SNR over Rayleigh fading 
channels is [|32ll 



where ca denotes the error and 7a denotes the average received SNR. Thus, (fTTT i can be rewritten 

as 



E [Mk] 



2 2Vi+7^. ' - ' 



2 2 Y l+7^fe+i 



2 + 

where 7^;^^ is the average received SNR of Uk at the relay and 74^^^ is the average received SNR 
at Ui in time slot k in the second phase. 

Thus the error probability of the system can be expressed as 

1 ^ 

^^(F) = -5^Pe.(F), (20) 

i=l 

where Pg,* (F) is given in (fT6l) . 
The throughput of the system: 

Here we define the throughput as the symbols received correctly at all the users per time slot. 
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Then the throughput of the system with NC is given by 



N (N - 1) (1 - PJF)) 

Tnc = — ^ — l^Jl (21) 

NC 2N -1 ' 



where N denotes the number of the users and Pe (F) is given in (|20l) . 

As the extension of two-way relay network [|33l . for the system without NC, we first derive 
the SEP of the system. For one user, the error probability of other users receiving Xi can be 
expressed as 

N-1 

Pe,,, = {N- l)Pe (Xi) + (1 - Pe (Xi)) (r,„) (22) 

J = l 

where Pg (xj) denotes the error probability of the relay detecting Xj and Pg (r^ j) denotes the 
error probability of Uj detecting rj. In the system without NC, it is evident that rj is Xi. 
Then the SEP of the system without NC can be expressed as 

N ( N-1 

Pe' = 5^ < (iV - l)Pe (X.) + (1 - Pe (x,)) Pe 
i=l I j=l 

N ( / , x , / / , \ \ N-1 



i=l ^ - V , - + lxj 2 V 2 V V 1 + 7x J / ^ V V 1 + 7i,r, 



(23) 

where Eq. (19) is used in the last step. 

The throughput of the system without NC can be easily obtained as 

JiiVo NC — ^ , (^4) 

where Pg' is given in (|23]) . 

In the high SNR region, the throughput of the scheme with NC and without NC can be 
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expressed as 



SNR^oo 

= lim Tnc 



(25) 

2N-1 ^ ^ 



^No NC - ^ lip ^No NC 

N -1 

where T^^^^ and ^yc denote the throughput of the system with and without NC in the high 
SNR region, respectively. 

Based on (l25l) (|26l) . the absolute value of the throughput improvement equals to 

'^A — '^NC '^No NC 

N{N-1) N-1 



T f 1 - TTTT^ I • (27) 



2N -1 
4 V 2N -1 

It can be seen that increases with A^, which indicates that increasing the number of the users 
brings an improved performance of the absolute value of the throughput. 

C. System performance bound 

Above, the exact closed-form expressions of SEP and throughput of the system have been 
derived. However, it can be seen that the expressions are very complex and provide few insights. 
In this subsection, the tight upper bound of SEP of the system will be derived to show useful 
insights. 

Lemma 4: For addition and multiplication in GF(2), we have 

a © 6 < a + 6, (28) 
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(AB) mod (2) < AB, 



(29) 



where A is a (L x M) matrix and B is a column vector with M elements. Symbol a and h, all 
the elements in A and B are in GF(2). 



where in the high SNR region, the probability that two errors occur simultaneously is much 
lower than the probability that only one error occurs. Thus, the condition that more than two 
errors occur simultaneously can be ignored. We note that this simplified bound is still very tight 
as we can see from following simulations. 

Based on (l30l) . the error probability of user i can be calculated as 



Proof: See Appendix D. 



Using Lemma 4, (fTTI) can be upper bounded as 



Xe,i < (xi © 5i) + (liv-ixi {xi © Xi)) + (r © f^)) 




(30) 



Pe. (F)<i5;[|x,^,;|] 



N N-1 



(31) 




Using (|3TI) . the SEP of the system can be upper bounded as 




(32) 



Substituting (fTSl) into (l32l) . the final upper bound of the SEP of the system is 




(33) 
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From the above expression (|33T ). it can be seen that for Ui, the received error probability is made 

N 

up of three parts: the first item ^ P(e^j.) resuks from the transmissions of — 1 users 
except Ui in the first phase, the second item (iV — 1) P (e^..) results from the transmission of Ui 

N-l 

itself in the first phase, the third item ^ P (ci^r,,) \ Gi,k\ results from the transmissions of relay 

k=l 

in the second phase. It is evident that the impacts of the three items on the SEP of the system 
are on the same order of magnitude, since their coefficients are all about — 1. It can also be 
seen that the third item has the largest impact on the SEP of the system, since |Gi,fc| > 1. As 
only one relay is employed to assist the users and the users do not cooperative with each other 
in the system, the diversity order of the proposed scheme is 1. 

D. Designed BMNC encoding matrix 

Above, the connection between the system error performance and the network coding matrix 
is provided. Moreover, several design criteria of the network coding matrix, which ensure the 
successful information exchange, are also investigated. It can be seen from (l33l) that the network 
coding matrix has a significant impact on the system performance. Thus the network coding 
matrix should be designed carefully to further improve the system error performance. 

In practical systems, the distance between the relay and the user varies for different users. 
Then the average received SNRs at the relay for different users may not be the same. Moreover, 
for high order modulations, the power that the relay uses also varies for different symbols. 

It is assumed that the statistical channel state information is known at the relay. Without loss 
of generality, we assume that the statistic channel conditions between the users and the relay 
have an ascending order from Ui to Un, which means that the statistic distance between the 
Ui and the relay is larger than that between the Uj and the relay when i < j. Moreover, we 
assume that the power that the relay uses to broadcast the detected information has a descending 
order from time slot 1 to time slot — 1, which means that the power that the relay uses in 
time slot i is higher than that in time slot j when i < j. In the following, we will show that 
the assumed order of the channel gains and that of the power allocation formulate the designed 
network coding matrix in a more detail. 
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To simplify the network coding design process and save the memory, we propose a puncturing 
operation based network coding matrix design scheme. In this scheme, the designed encoding 
matrix of users is a matrix in which the upper left comer is the designed encoding matrix 
of — 1 users. Thus, the relay only needs to memorize the designed network coding matrix of 
the maximum number users. 

We design an encoding matrix for N users as 



\N users 



1 1 ■■■ 
1 1 ■■■ 



1 ■■■ 1 
the optimality of which will be discussed in Section V. 



(34) 



(Af-l)xAr 



V. The Optimality of the Matrix Given in (134]) 



In this section, we will discuss the optimality of the matrix given in (1341) . First, we consider 
the situation that there are three users. 

Lemma 5: For = 3, the designed network coding matrix F, which minimizes the bound 
of SEP, can be designed as follows 



|3 users 



1 1 
1 1 



(35) 



Proof: See Appendix E. ■ 
In the following, the designed network coding matrices for the systems with more than 
three users will be discussed. We suppose the designed encoding matrix of — 1 users is 
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a (A^ — 2) X (A^ — 1) matrix, which is evaluated by 



F|Ar_l 



1 1 ■■■ 
1 1 ■■■ 

1 ■■■ 1 



(36) 



(Ar-2)x(iV-l) 

Based on this assumption and Lemma 5, if the designed encoding matrix of users is the 
matrix described in (|34] ). using mathematical induction, the matrix given in (|34l ) is the matrix we 
want. Using (|36l ) and the proposed network coding matrix design scheme, the designed encoding 
matrix of N users can be written as 



\N users 



1 1 

1 1 

1 

Ui U2 Us 








bi 
b2 



1 

UN-1 Un 



(37) 



{N-l)xN 

where hi, 62, • • &Af-2, ui, U2, . . un are unknown elements, which are in GF(2). 

Using Theorem 1, each element of the last column vector, bi, b2, ■ ■ ■, bM-2, should be the sum 
of the other elements in its row vector in GF(2). For the first N — 2 row vectors, since there are 
just two "1" elements expect the last column in one row, we have fei = 62 = • • • = &Af-2 = 0. 

Since Fj should be full rank for any i, each column vector of should not be a zero column 
vector. Thus, in the last column vector, un should be 1, since the other elements in this column 
vector are all 0. The designed encoding matrix of users can be rewritten as 



|A'' users 



1 1 

1 1 

1 

Ui U2 Us 







1 

UN-1 1 



(38) 



J (Af-l)x7V 
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where using Theorem 1, we have 



N-l 
k=l 



Uk 



(39) 



Thus, only ui, U2, ■ ■ ., un-i, which are the elements of the last row vector, are left to be 
designed to improve the system error performance. From (l33l . it can be seen that the network 
decoding matrices have considerable impact on the SEP of the system. In the following, we 
need to acquire the network decoding matrix F^^^ for i G [1, A^]. Elementary row operations are 
used to obtain the network decoding matrix. 

For the first user, using elementary row operations, we have 



Fil 



N users 



1 •■■ 
1 ■■■ 





U2 Us 

1 • 
1 • 

• 

• 



1 

un-1 1 



1 •■■ 
1 ■■■ 












1 
1 



1 

1 



U2 Us 



1 
1 




1 

Un-1 1 



(Ar-l)x(2Af-2) 








(40) 



{Ar-l)x{2Af-2) 

where in the last step, the first N — 2 row vectors are multiplied by different coefficient, for 
example the jth row vector is multiplied by n^+i for j < N — 2, and then are all added to the 
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last row vector. From (|40] ). the inverse matrix of Fi is 



1\N users 



1 

1 



U2 U3 







1 

Un-1 1 



Fi. 



(41) 



J (Af-l)x(Ar-l) 

For the second user, using elementary row operations, we have 



2\N users 



1 

1 1 

1 

Ml U3 

1 ■ 
1 • 

■ 

Us ■ 








1 

UN-1 1 








1 

UN-1 1 



1 
1 




1 
1 1 

1 

Ml 







1 
1 





1 
1 



(Af-l)x(2iV-2) 



(42) 



J (Ar-l)x(27V-2) 

where in the last step, the first row vector is added directly to other N — 3 row vectors except the 
last row vector, which is added by the first row vector multiplied by Ui. Then for 2 < j < N — 2, 
multiplying the jth row vector with Uj+i and adding the product to the last row vector, (|42] ) can 
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be rewritten as 



F2|Ar users I 



1 
1 













1 
1 





1 







1 

Un-1 1 



(Af-l)x(2Ar-2) 

(43) 



From (|43T ). the inverse matrix of F2 can be expressed as 



2\N users 





1 













N-1 
k=l 



Uk U3 



1 

Un-1 1 



(44) 



{N-l)x{N~l) 



In the same way as the inverse matrix of Fi and F2, the inverse matrix of Fj for i G [3, — 1] 
can be described as 



i\N users 





1 
1 









U2 M3 







1 















1 



















N-1 



Un-1 1 



(45) 



J (Af-l)x(7V-l) 
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and the inverse matrix of F^r equals to 



N\N users 





1 
1 



1 
1 
1 



••• 1 1 

U2 Ms ■ • • Un-I 1 



(46) 



J {N-l)x(N-l) 



We design a matrix F|7v users, which can be shown in (1341) . In the following, we will prove 
that using F|7v users as the network coding matrix, the upper bound of the SEP of the system is 
minimized. 

Based on (gB, dS]), gS]) and using we have 



^ i^^^ ("^1^ users) Pe ^^l^v users^ 



N-2 / N-1 \ N-2 

k=l \k=l,k^2 J k=2 

N-1 r N-2 / N-1 

+ Yl ^ K ® ^i^k] Uk+i + E [ri_i © fi,i_i] I Y ®^k-l 

j=3 Lfe=l,fc^i-1 \k=l,k^i 
N-2 

+ ^ E [rfc © fN,k]Uk+l- 
k=l 



(47) 
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N-1 

Using (|39l ). we have ^ — 1 = — and (|47l) can be expressed as 



iv(Pe(F|^™)-Pe(F|_ 



lAf users ^ 

N-2 N-2 

= ^E[rk® fi,k]uk+i - E[ri® f2,i] U2 + ^E[rk® r2,k]uk+i 

k=l k=2 

N-1 r N-2 "I Af-2 

+ ^ < 5Z ^[^k® ri^k] Uk+1 - E [ri_i © fi,i_i] Ui \ + ^E[rk® r^^kW+i 

i=3 [k=l,k^i-l J k=l 

Ar-2 Af-2 

= ^ [vk © ri_fe] - E [rfc © rfc+i,fe]) Uk+i + ^ E [r^ © f2,fc]Mfc+i 

fc=l fc=2 
Af-1 Af-2 Af-2 

+ ^ E[rk® fiM] Uk+i + ^E[rk® rN,k]uk+i 

i=3 k=l,k=/=i-l k=l 

>o, 

where E [r^ © fi^k] > E [r^ (B fk+i,k] for k E [1, — 2] is used in the last inequality. From the 
above expression, since the coefficients of m^'s are all strictly positive, it can be seen that the 
requirement of N (^P^ {F\n users) - Pe {^\n «sers)) = is that = 0, A; = 2, 3, . . . , - 1, 
which indicates that only F|jv users meets the requirement. Thus the proposed encoding matrix 
for users is unique. 

Thus F is the designed encoding matrix for users, which minimizes the upper bound of the 
SEP of the system. Based on Lemma 5, using mathematical induction, we have the following 
theorem. 

Theorem 6: The designed network coding matrix of A^ users, which minimizes the bound of 
SEP, is given in (|34l). 

From Theorem 6, it can be seen that the designed network coding matrix is structured and 
sparse. The properties of the matrix simplify the encoding and decoding process while improving 
system performance. 

In practical systems, the relay first needs to know the number of the users. Moreover, the 
statistical information of the user to relay channels should be available to the relay. After receiving 
the necessary information, the designed NC matrix is constructed based on its closed-form 
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expression given by Theorem 6. According to the designed NC matrix, the users will send 
their information to the relay in turn. The relay will detect, encode, and broadcast the received 
information. Finally, each user decodes its received information by exploiting its own information. 

VL Simulation results 

In this section, the performance of the analytical results will be compared with Monte Carlo 
simulations. 

As discussed in the Section IV-C, in practical systems, the received power at the relay for 
different users is different and so does the transmit power of the relay at different time slot. 
The conditions of the simulations are set as follows: the transmit power at different users is the 
same and the average received SNR of Ui at the relay is 3dB worse than that of f/j+i, due to 
the different distances between the users and the relay; the power that the relay uses in time slot 
i is 3dB higher than that in time slot z + 1, since the power gap between two adjacent bits is 
3dB in some high order modulations; the smallest transmit power at the relay and the transmit 
power at one user are the same; the smallest received SNR at the Ui in the second phase of the 
system with NC is denoted as Eg/No in the following figures; BPSK modulation is considered. 
We assume the same total transmit power of the system with and without NC. 

Fig. 2 presents the throughput performance of the system with and without network coding, 
for 4, 5, and 6 users. The "X users with NC" curves are generated by combining (|20l) and 
(|2TI) . and the "X users without NC" curves are generated by combining (|23l) and (l24l) . The NC 
matrices are given by (|34] ). From the figure, it can be seen that compared to the system without 
NC, NC improves the throughput about 0.21, 0.22, 0.23 for 4, 5, 6 users respectively. This is 
predicted by (1271 ). which indicates that as the number of the users increases, the absolute value 
of the throughput improvement increases. Moreover, for 4 users, it can be seen that network 
coding improves the throughput about 14% in the high SNR region, while the improvement is 
about 11% for 5 users and about 9% for 6 users. For a practical communication service, such 
as video conference in which each user may want to have a discussion with the other users, the 
number of users is usually limited, e.g., 3 or 4 people when using Damaka™ [|34l . Furthermore, 
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it is natural that the error performance gain will decrease as the number of users increases for 
any MAC strategy. This indicates that our proposed scheme can improve the throughput of the 
system much in widely used scenarios. 

In Fig. 3, the simulated and tight upper bounds of the SEP performance of the system with 
network coding are compared for different number of users. The "numerical" curves are generated 
by (|33l) and the NC matrices are given by (|34] ). We can see that the numerical SEP curves 
accurately predict the simulation ones. From the figure, the SEP performance is slightly higher 
as the number of users increases. It means that the interference between the users is small, 
which is caused by the network coding at the relay. Thus when the number of the users is large, 
NC is still efficient in improving the throughput with not so much impact on the system error 
performance. 

In Fig. 4, the SEP performance of the system is compared with different network coding 
matrix, for 4 users, where the matrices 1, 2, 3 are, respectively. 



11 
10 1 
10 1 



11 
110 
110 



10 1 
10 10 
11 



It can be seen that our designed matrix can improve the SEP performance compared to an ad-hoc 
coding matrix. However, compared to the system without NC, our proposed scheme achieves a 
slightly poor SEP performance. The above simulations assume the same channel conditions of 
the source to relay channel and the corresponding relay to source channel. However, in Fig. 5, 
we will show that our proposed scheme may achieve a better SEP performance in other channel 
conditions. 

In Fig. 5 when the source to relay link is 20dB better than the corresponding relay to source 
link, it can be seen that our proposed scheme achieves about 2dB SEP gain compared to the 
system without NC. This indicates that our proposed scheme can improve both the throughput and 
the error performance in the system where the source to relay link is better than the corresponding 
relay to source link, such as one satellite assists the information exchange of several base stations. 
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VII. Conclusions 

We have investigated the design of binary linear NC for A^-way relay networks, where 
end nodes exchange their information with the assistance of one A^-way relay. NC matrix in 
GF(2) was proposed to describe the linear NC process. The design criteria of the NC matrix, 
which improve the SEP performance, were provided. Moreover, the closed form expressions and 
the upper bound of SEP of the system were given. It can be seen that using linear NC, the 
throughput gain of the system is more than 10% for less than 6 users. To improve the system 
performance further, we designed NC matrices for arbitrary number of users, which minimized 
the bound of SEP 

Appendix A 
Proof of Theorem 1 

First, we need to prove that it is the necessary condition. We suppose that all the network 
sub-encoding matrices are full rank. Since Fj is full rank, the — 1 column vectors of Fj form a 
(A^ — 1) -dimensional linear vector space in GF(2) and any other column vector in GF(2) which 
contains A^ — 1 elements is in this linear vector space. Then can be expressed as a kind of 
linear combining of the column vectors of Fj. That is 

k=l,k^i 

where (3^ can only be or 1. For any user j, j ^ i, if (3j in (|49l ) is 0, then F^ only has no more 
than A^ — 2 linearly independent column vectors. Thus, Fj is not full rank. That is a contradiction 
of the hypothesis. So (3k in (|49l) should be 1, for j E [1, N] ,j ^ i. Thus we have proved that it 
is the necessary condition. 

Second, we need to prove that it is the sufficient condition. For j,j ^ i, Fj can be spread as 



F 



fl ■■■ f._i f. f.+i ■■■ f,_i f, + i ■■■ fA. 

N 1 (50) 

fl ■ ■ ■ fj-l Yl ®ffe fj+l ■ ■ ■ fj'-l fj'+l ■ ■ ■ f Af , 
k=l,k^i J (Ar_l)x(Ar-l) 
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where ([10] ) is used in the last step. Then we add all the column vectors except the ith column 
vector onto the ith column in GF(2), which is the elementary column operation on Fj. Thus Fj 
can be expressed as 



fl ■ ■ ■ fi-l fj fj+1 ■ ■ ■ fj-1 fj+l ■ • • 
fl ■ ■ ■ fj-l fi+1 ■ ■ ■ fj-1 fj fj+1 ■ ■ • 



(Af-l)x(Af-l) 
(7V-l)x(7V-l) 



(51) 



From (pTI) . it can be seen that Fj has the same rank of Fj, which means that Fj is full rank if 
Fj is full rank. Thus we have proved that it is the sufficient condition and the theorem is proved. 



Appendix B 
Proof of Theorem 2 

Using ©J Eq. (|9]) can be rewritten as 

Xj = {F^i {r © r © f j © {(fjXj) mod (2)}}} mod (2) 
= {F-^ (r © {(fjX,) mod (2)} © (r © fj))} mod (2) 



Fri 



{(Fi) mod (2)} © {(fjX,) mod (2)} © (r © ?j 



V 



I 



The item 1\ in the above equation can be expressed as 



r k=N 

k=l 
k=N 

E ®f2,kXk 
II = k=l 



k=N 
k=l 



f2,iXi 
fN-l,iXi 



(52) 



mod (2) 



(53) 



(Ar-l)xl 



(Af-l)xl 
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and the above expression can be rewritten as 



k=N 

k=l,kj^i 
k=N 

k=l,k^i 



h 



k=N 

-l,kXk 

k=l,k^i 



VE? -X/^ 



(54) 



{Ar-l)xl 



(Ar-l)xl 

= {(F,5,) mod (2)} © {(f, {xi © X,)) mod (2)} , 
substituting (|54l ) into (|521 ). (|52l ) can be written as 

X, = {Fri {{(F,ii) mod (2)} © {(f, {xi © x,)) mod (2)} © (r © F^)}} mod (2) . (55) 

Since GF(2) is Galois field, distributive law of multiplication exists. Using distributive law of 
multiplication, (1551) equals to 

Xi = {(F-^Fiii) mod (2)} © {(Frif,(a;,©5,)) mod (2)}©{(Fri (r © f,)) mod (2)} 
= © { (Frif, (x, © X,)) mod (2)} © { (Fr^ (r © f^)) mod (2)} . 



(56) 



Using Theorem 1, since Fj should be full rank, we have 



N 



mod (2) 



k=l,ky^i 

fl ... fj_l fj+l ... f/vT 

= (Fajv-ixi) mod (2). 
Based on (|57]) . Eq. (|56l) can be expressed as 

Xi = 5, © ((xi © X,) l^_ixi) © { (F-^ (r © Vi)) mod (2)} 



(57) 



(58) 



where F^ ^Fj = I is used. Thus the theorem is proved. 
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Appendix C 
Proof of Lemma 3 

Mathematical induction is proposed to prove the theorem. First, consider the addition of two 
numbers in GF(2), 

a®b = a + b- 2ab. (59) 
Then we assume that the addition of Q numbers in GF(2) can be written as 

f;% = f;(-2r^ Y: fla,, (60) 

q=l 9=1 1<P1<P2<---<P9<Q j = l 

Using (|59l ). the addition of Q + 1 numbers in GF(2) can be expressed as 



-1 

g=l q=l 

Q Q 

= ag+i + ^ ®aq - 2aQ+i ^ ®ag, 

q=l 9=1 

and taking (|60l ) into the above expression, (1611 can be rewritten as 

Q+l Q Q q 

9=1 9=1 9=1 l<Pi<P2< - <Pg<Q i=l 



(61) 



9 



(62) 



9=1 l<Pl<P2< - <Pg<Q i = l 



Separate the q = 1 term from the second item of (|62] ) and separate the g = Q + 1 term from 
the last item of (|621 ). we have 



Q+l Q Q 9 

9=1 pi 9=2 l<Pi<P2<-<Pg<Qi=l 

(63) 

Q 9 Q+l 



+5: (-2)'- E n««+(-2)''n 

9=2 l<pi<p2<-<Pg<Q+l j=l pi=l 



Next, combining the first item and second item of (|64] ). and combining the third item and 
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forth item of (|64] ). the above expression can be reformulated as 

Q+1 Q+1 Q+1 q 



HI 



q=l pi q=2 1<P1<P2<-<P9<Q+1 i = l 

Q+1 q 

g=i i<pi<p2<-<P9<Q+i j=i 



Pl=l 



(64) 



From the above derivation, (fT2]) is convenient for the addition of two numbers in GF(2). 
Moreover, based on the addition of Q numbers in GF(2), (fT2] ) is also convenient for the addition 
of Q + 1 numbers in GF(2). Thus, using mathematical induction, for arbitrary numbers, the 
addition in GF(2) can be expressed as (fT2] ) and Lemma 3 is proved. 



Appendix D 
Proof of Lemma 4 

Eq. (|28] ) is obvious and we focus on (|29l) . We define that [A]j j = j and [B] 
the multiplication between matrix A and B in GF(2) can be expressed as 



h.:. Then 



(AB) mod (2) = 


- M 

i=l 

M 

i=l 


< 


- M 

E «l,i^i 
i=l 
M 

E '^2,i&i 
i=l 




M 

E ®(^L,ihi 

- i=l 


Mxl 


M 

E (^L,A 

- i=l 



AB. 



(65) 



Mxl 



where a®h < a + h h used in the inequality. 
Thus we have (|29l ) and the lemma is proved. 
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Appendix E 
Proof of Lemma 5 

For = 3, from (|32] ). the error probability of the system in this situation can be calculated 



as 



(66) 



3Pe < 4 [Xi ®Xi]+E [X2 ®X2]+E [X3 © X3]) 

+ (/2,2 + /2,3) E [n © h,l] + (/l,2 + /l,3) E [r2 © ^,2] 
+ (/2,1 + /2,3) E [n © r2,i] + + /i,3) E [r2 © f2,2] 

+ (/2,1 + /2,2) E [n © f3,l] + + /l,2) ^ [r2 © h,2] 

= 4 ((Xi © Xi) + {X2 © X2) + ix3 © X3)) 

+ {E [r2 © r2,2] + ^ h © f3,2]) + [ri © f2,i] + ^ [^i © r3,i]) /2,i 
+ [r2 © fi,2] + ^ h © ^3,2]) /i,2 + {E [ri © fi,i] + E [n © f3,l]) /2,2 
+ [r2 © fi,2] + ^ h © f2,2]) /i,3 + {E [n © ri,i] +E[n® ^2,1]) /2,3- 



To ensure that each user can obtain other two users' information, the column vector of the 
encoding matrix F can only be [1, 1]^, [0, 1]^ and [1, 0]^. Using the assumptions of the statistic 
channel conditions between the users and the relay and the power that the relay uses to broadcast 
the detected information, we have 



E [ri © fi,i 
E [r2 © fi,2 
E [r2 © ri,2 
E [r2 © r2,2 
E [r2 © f3,2 



> £^ [ri © f2,i] > £^ [ri © f3,i] 

> E[r2® f2,2] > ^ [r2 © f3,2] 

> E[ri® h,i] , 

> E [n © f2,i] , 

> E[ri® f3,i] . 



(67) 
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From the above relationship, it can be seen that 

E [ri © +E[ri® 

< min {E [r2 © ^,2] + E\r2® ^,2], E [ri © fi,i\ + E[ri® f^^) , 
max {E [r2 © ^,2] + ^ ^ © rg^s], ^ [n © +E[ri® r^^) , 

< min {E [rg © f^a] + E[r2® f^^, E [ra © ^,2] + ^ ^ © rg^a], ^ [ri © ri,i] +E[ri® ^2,1]) • 

(68) 

Thus, E [ri © f2,i] + E[ri(B fs^i] is the smallest one and to minimize the SEP of the system, 
in (|66l ). the coefficient of E[ri © r2,i] + E[ri © r3,i] should be largest and then we have /2,i = 1. 
Moveover, E [r2 © r2,2] + -E [r2 © and E [ri © fi^i] + [ri © fs^i] are smaller than the rest, 
which result in /i^i = /2,2 = 1- Thus (|35l) is proved. 
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Fig. 1. System model. 
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Fig. 2. Comparisons between the throughput of the system with and without network coding, for 4, 5, and 6 users. 
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Fig. 5. The SEP of the system with different network coding matrix when the source to relay link is 20dB better than the 
corresponding relay to source link, for 4 users. 



